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On  the  Stability  of  Bayes  Estimators  for  Gaussian  Processes 

by 

Ian  W.  McKeague 


Abstract 

We  consider  the  Bayes  estimator  do  for  a  Gaussian  signal  process 
observed  in  the  presence  of  additive  Gaussian  noise  under  contamination 
of  the  signal  or  noise  by  QN-laws,  introduced  by  Gualtierotti  (1979). 

Upper  bounds  on  the  increase  in  the  mean  square  error  of  60  over  the 
minimum  possible  mean  square  error  under  contaminated  noise  or  contaminated 
signal  are  given.  It  is  shown  that  the  performance  of  do  is  relatively 
close  to  optimal  for  small  amounts  of  contamination. 


1.  Introduction. 
1  ■  ■  ■  ■ 


The  Bayesian  approach  to  the  robust  estimation  of  a  signal  in  the 
presence  of  noise  has  been  studied  extensively  in  recent  years.  Some 
authors,  including  Blum  and  Rosenblatt  (1967),  Solomon  (1972),  Watson 
(1974)  and  Berger  (1982)  have  discussed  procedures  which  can  be  used 
when  only  vague  information  concerning  the  prior  distribution  is  avail¬ 
able.  Others,  including  Box  and  Tiao  (1968),  Masreliez  (1975)  and 
Ershov  and  Liptser  (1978)  have  constructed  estimators  which  are  robust 
with  respect  to  contamination  of  the  noise  distribution. 

The  purpose  of  the  present  article  is  to  study  the  performance  of 
the  usual  Bayes  estimator  (denoted  £«)  for  Gaussian  prior  and  additive 
Gaussian  noise  under  certain  deviations  from  normality  in  either  the 
prior  or  the  noise  distribution.  It  is  shown  that  the  performance  of 
So  is  relatively  close  to  optimal  for  small  amounts  of  contamination.  The 
main  results  of  the  paper  give  upper  bounds  on  the  increase  in  the  mean 
square  error  of  60  over  the  minimum  possible  mean  square  error  under  a 
specific  contaminated  prior  or  contaminated  noise  distribution.  These 
results  make  it  possible  to  assess  the  loss  caused  by  the  use  of  6»  under 
non-Gaussian  conditions.  The  contaminated  Gaussian  laws  used  in  this 
paper  are  QN-laws  (quasi-noise  laws)  which  were  introduced  by  Gualtierotti 
(1979).  QN-laws  form  an  appropriate  class  of  contaminated  Gaussian  laws 
for  some  infinite  dimensional  models  arising  in  communication  theory  (see 
Gualtierotti,  1980).  Gualtierotti  (1982)  recently  studied  the  stability 
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of  signal  detection  under  mixtures  of  Gaussian  laws  as  well  as  QN-laws. 
Contamination  by  Gaussian  mixtures  was  shown  to  lead  to  worse  behavior 
than  contamination  by  QN-laws.  In  the  present  paper  attention  is 
restricted  to  contamination  by  QN-laws. 

Section  2  contains  some  preliminary  material  on  measures  on  locally 
convex  spaces  and  a  derivation  of  the  Bayes  estimator  for  Gaussian  prior 
and  Gaussian  noise  on  infinite  dimensional  spaces.  Section  3  contains  a 
discussion  of  QN-laws  defined  on  locally  convex  spaces  and  a  description 
of  the  posterior  distribution  when  the  prior  or  the  noise  is  a  QN-law. 
Upper  bounds  for  the  increase  in  the  mean  square  error  of  60  over  the 
minimum  possible  mean  square  error  under  a  QN-law  prior  or  QN-law  noise 
are  given  in  Section  4.  Some  exaag>les,  including  an  application  to 
Kalman  filtering,  are  discussed  at  the  end  of  the  paper. 

2.  Preliminaries. 


Let  (S,S)  and  (T,T)  be  measurable  spaces,  a  probability  measure 
on  S  x  T,  ux  and  Uy  the  projections  of  The  conditional  distribution 

ux|y,  if  it  exists,  is  defined  to  be  a  probability  measure  on  S  for  a.e. 
dVy(y)  such  that  (A)  is  measurable  as  a  function  of  y  for  each 

fixed  A  e  S  and 

y^  (A  x  B)  s  JB  ux|y(A)dUy(y)  for  all  A  e  S  and  B  e  T. 

It  follows  from  the  definition  that  yxjy  «  px  a.e.  diiy(y).  The  following 
lemma,  which  is  proved  using  Fi&ini's  theorem,  states  the  abstract  Bayes 


formula  of  Kallianpur  and  Striebel  (1968). 


exists  and 


Leaaa  2.1.  Suppose  that  the  conditional  distribution 

dMYlx 

-  fmm _ N  I _ *  I  *  *r _ «  «  _ 


the  nap  (x,y) 


(y)  is  S  x  T  measurable.  Then  the  conditional 


distribution  y^  exists  and 


^Elr  (x, .  *£ls  w 

dyv  1  }  dy.,  a.e. 


dy^UyCx.y) 


The  probability  measure  y^  is  to  be  defined  through  a  prior  dis¬ 
tribution  ux  on  S  and  a  noise  distribution  y^  on  T.  S  is  the 
parameter  space  and  T  is  the  observation  space.  Let  f:S  *T  +  T  be 
an  S  x  T/T  measurable  map.  Define  y^  by  y^  (A)  * 
yx  9  Hjj  {(x,y)  :  (x,f(x,y))  e  A),  It  is  easily  seen  that  yy|x  exists 
and  is  equal  to  y^f”1  where  f  :  T  T  is  defined  by  fx(y)=f(x,y) . 
When  uxjy  exists  it  is  called  the  posterior  distribution. 

Before  going  further  we  need  to  make  a  brief  detour  through  the 
theory  of  probability  measures  on  topological  vector  spaces.  Let  E 
denote  a  locally  convex  topological  vector  space  with  topological  dual 
E'.  The  cylindrical  o-algebra  on  E  is  the  o-algebra  generated  by  E' 
and  is  denoted  o(E').  Let  y  be  a  probability  measure  on  o(E')  such 

x 

v. 

that  /g  <  f,x>2  dy(x)  <  for  all  f  in  E'.  Then  y  has  a  mean  m 
and  a  covariance  operator  R  and  under  mild  conditions  m  belongs  to 
E  and  R  maps  E'  into  E  (See  Vakhania  and  Tarieladze,  1978). 

Schwartz  (1964)  showed  that  if  E  is  quasi-cosplete  then  each  covariance 
operator  R:E'  E  has  a  unique  Hilbert  space  H,  which  is  a  vector  subspace 
of  E,  such  that  the  natural  injection  j  of  H  into  E  is  continuous 
and  R  *  j j*.  The  Hilbert  space  H  is  called  the  reproducing  kernel 
Hilbert  space  (RKHS)  of  R.  If  the  RKHS  is  separable  with  a  CONS 
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( en ,  n  i  1}  then  the  covariance  operator  admits  a  series  representation 

R  *  J  je^  0  jen<  where  (u9u)(f)  *  <f,u>u,  for  ueE.feE',  and  the 
n 

series  converges  to  R  in  the  strong  operator  topology: 

j®n  •»  Rf  in  E  for  all  f  in  E'.  A  probability  measure 
u  on  o(E')  is  Gaussian  if  each  f  in  E*  is  a  Gaussian  random 
variable  under  p.  The  methods  used  in  this  paper  depend  on  the  existence 
of  a  separable  RKHS  for  the  covariance  operators  of  Gaussian  measures. 

For  this  reason,  we  assume  throughout  that  E  is  quasi* complete  and  each 
Gaussian  measure  u  has  a  mean  m  e  E,  a  covariance  operator  R:E*  -*■  E 
and  a  separable  RKHS.  Such  a  Gaussian  measure  is  specified  by  u  *  N(m,R). 

Now  assume  that  »  NCO.R^)  on  cr(E')  with  RKHS  denoted  H^  and 
injection  E,  vx  «  NC^Rg)  on  oO^),  (S,S)  *  (H^ofly), 

(T,T)  ■  (E,o(E'))  and  f(x,y)  «  j^(x)  ♦  y.  Let  L N  denote  the  closure  of 
E'  in  L2(E,un),  -►  Hj^  the  unitary  operator  defined  by 

V  *  j*f,  for  f  in  E*.  Rx  is  a  trace-class  operator  on  so  it 
has  a  series  representation  Ry  3  Tn  ®n#®n»  w*ieie  is  a 

CONS  in  H^,  2:  0  and  tr(R^)  3  I  Tn  <  “*  1  denotes  the  identity 


operator  on  H^. 

The  following  proposition,  well  known  for  finite  dimensional  spaces, 
gives  the  posterior  distribution  for  Gaussian  prior  y^  and 

Gaussian  noise  i^. 


Proposition  2.2.  Let  *  NCO^),  yx  *  NCm^.Rj^).  Then  the  posterior 
distribution  yx|y  exists  as  a  probability  measure  on  o^)  and  is 
given  by  yxjy  *  NCm^,  R^),  where 

My  *  I  1  M  *  <^-  >  V  h\y  *  V1  *  V'1- 


Proof-  Denote  [t^X(en)](y)  by  c*n(y).  The  an  are  i.i.d.  N(0,1) 
random  variables  under  y^  so  that  “yjy €  a*e*  dy^Cy).  But, 

V1^1  *  ‘'n  for  each  x  €  ^  *cf*  19*2,  Theorem  2.1)  so 

that  by  Baker  (1976)  Uy  *  ly  Thus  a^y  t  ^  a.e.  dy^y)  and  the 
pair  (myjy,!^^)  defines  a  Gaussian  measure  on  0(1^)  a.e.  dwyty). 

Now  check  the  conditions  of  Leona  2.1.  ^y|x  exists  and  is  equal  to 

WN*^'  T*ie  "“P  (*•/)  *— "*  d*V|x/dVy)  *s  0<V  *  o(E’)  measurable 

since 


duri»f  , .  dv  . ,  % , , 

“357“(y)  3^  w 

■  exp  !tUN1(x)l  -  %  II  *  112  > 

dl*N 

*  dj^y)  exp  l  {an(y)  "V**  *  *  <en»x>t)  ' 

where  the  Radon-Nikodym  derivative  dy^f”*  / dy^  is  given  in  McKeague 
(1982,  Theorem  2.1),  for  instance.  Now  applying  Lemma  2.1,  the  character* 
istic  functional  yx|y(u)  ■  J^e  i<u,x>dyx|y(x) ,  for  u  e  H^,  as  a  function 

of  u,  is  proportional  to  lim  Z^(x)dyx(x),  where 

k 

Z^x)  -  exp  l  (i<en,uxen,x>  ♦  «n(y)<en,x>  -  .%  <en,x>2}. 
n»l 


Provided  that  {Z^,k  36  O  is  uniformly  integrable,  the  result  now  follows 

from  routine  calculations  since  the  <en,x>,  n  2  1  are  independent 

N(<e  ,mv>,  t  )  random  variables  under  yv.  But 
n  a  n  a 
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£  ex?  *2  X  an  (y)<on»x>)dwx(x) 
k 

*  exp  {2  l  (a 2(y)t  ♦  a  (y)<e  ,m„>  )>, 
n=  1  n  n  n  n  a 

tdiich  shows  that  {Z^,k  2s  1}  is  a.e.  duy(y)  uniformly  integrable  with 
respect  to  y^,  as  required. 

3.  QN- Laws . 

Let  Ej  and  E2  be  locally  convex  spaces.  Suppose  that  y  =  N(m,R) 
on  o(Ej)  with  RKHS  denoted  H  and  injection  j  :  H  E^;  also  let 
A  :  Ej  -*•  Ej  be^a  symmetric  non-negative  operator,  o  e  R,  a  e  E2  and 
J  :  Ej  -*■  E2  be  a  continuous  linear  map.  Provided 

c  1  =  / c  (<*2  ♦  <A(J(x)-a),  J(x)-a>)dy(x)  <®, 

1 

define  a  probability  measure  v  on  o(Ej)  by  v  *  y  if  c"1  «  0, 
otherwise  by  the  relation 

J.. 

<^CX)  =  c(a2  ♦  <A(J(x)-a) ,  J(x)-a>). 

The  measure  v  is  called  a  QN-law  and  was  introduced  on  Hilbert  space  by 
Gualtierotti  (1979).  If  J*AJ  has  a  separable  RKHS  then  c_1<»  if  and 
only  if  j*J*AJj  is  trace- class,  and  in  this  case 

c"1  »  o2  ♦  tr(j*J*AJj)  ♦  <A(J(m)-a),  J(m)-a>. 

It  is  always  possible  to  assume  that  a  is  either  zero  or  one.  We  shall 
assume  that  a  *  1  and  write  v  »  QN((J,a,A) ,y) .  When  Ej  »  E2  and  J 
is  the  identity  map  write  v  «  QN((a,A),y ).  Gualtierotti  (1980)  calculated 
the  mean  and  covariance  operator  of  v  for  the  case  of  a  separable  Hilbert 
space.  It  is  possible  to  extend  this  result  to  separable  Banach  spaces  as 


follows. 
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Lemma  3.1.  Suppose  that  is  a  separable  Banach  space  and  J*AJ  has 

a  separable  RKHS.  Then  the  mean  and  covariance  operator  R^  of  v 
are  given  by 

Q 

*  a  ♦  u 

RQ  *  R  ♦  2cRJ*AJR  -  u0u, 
where  u  *  2cRJ*A(J(m)-a)  . 

Proof.  (Sketch)  Assume  that  m  ■  0  and  consider  just  the  evaluation  of  R  . 

Let  J*AJ  »  g^  e  E*.  Then,  for  f  e  E*, 

n 

/E  <f,x>2<  AJ(x),J(x)  >  dn(x)  =  I  /£  <f,x>2  <g.,x>2  dy(x), 

1  n  1 

so  that  we  can  reduce  to  evaluating  integrals  of  the  form 

/_  <f,x>2  <g,x>2  du(x) .  Choose  h  «  El  such  that  j*(h  ),  ni  1  is  a 

n  i  n 

CONS  for  H.  Define 

k 

V  '  J,  'hn"x>  “V  *  1  E1‘ 

n=l 

Then,  by  Tien  (1978,  Lemma  2),  ir^x  converges  a.s,  [y]  to  x.  But 


^Ej  <f,wkx>4  <g,Vjlx>*‘  dy(x)  S  {/g  <f,*kx> 

S  105  <Rf,f>2 
k 

since  <f,*.x>  is  a  N(0,  \  <Rh  ,f>2 ) 

K  n«l  n 


8  du(x)}5*  (L  <g,ir.x>8 
I  C1  K 

<Rg»g>2# 

random  variable  and 


dy(xjp 


1  <Rh  ,f>2  s  <Rf,f>.  It  follows  that  {<f,ir.x>2  <g,wkx>2  ,  k  st  1) 
n»l 

is  uniformly  integrable  and  the  Lebesgue  convergence  theorem  can  be  applied. 
The  integral  /g  <f,irkx>2  <g,i»kx>2  dy(x)  can  be  calculated  using  the  fact 
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that  <hn,x>  ,  n  a  1  is  an  i.i.d.  N(0,1)  sequence  of  random  variables 
with  respect  to  m.  E 

The  next  proposition  shows  that  the  posterior  is  a  QN-law  if  either 
the  prior  is  Gaussian  and  the  noise  is  a  QN-law  or  the  prior  is  a  QN-law 
and  the  noise  is  Gaussian.  Let  =  N(Q,R^),  ux  *  NCn^.R^)  as  in 
Section  2  and  let  uxjy  denote  the  corresponding  posterior  distribution 
given  in  Proposition  2.2. 

Proposition  3.2.  (i)  If  the  prior  is  yx  *  NCrn^iy)  and  the  noise  is 

vN  *  QNfCa.A),^)  then  the  posterior  is  vx|y  =  QN((jN,y-a,A)  .V^y)  • 

(ii)  If  the  prior  is  vx  =  QN((a,A),yx)  and  the  noise 
is  =  NCO.Rj^)  then  the  posterior  is  vxjy  =  QN((a,A)  ,wxjy)  • 

The  proof  of  this  proposition  uses  the  following  consequence  of 
Lemma  2.1. 

Lemma  3.3.  Let  and  be  probability  measures  on  S  x  T  such  that 

(a)  yx~vx  and  iywy; 

(b)  Vy|x  and  vy|x  exist  and  | xVVY | x  a'e*  dwx^’ 

(c)  the  maps  (x,y)  i — >dvyjx/  dyY|x^*  (x,y)  *— >  d^yJx/^M 
are  $  *  T  measurable.  Then  vx|y  exists,  Vx|y'x'vix|y  a,e*  duY^  and 


dv„ |  dyy  dv  I  dvx 

dU^(x)  3  7J^y)  d5^Cy)  a-e*  dwX®yY(x*y)' 


Proof.  Using  (a)  and  (b)  get 


35^  3^y)  a-e- 
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so  that,  by  (c),  the  function  (x,y)  1 — ->  d\iy|x^/  dv^(y)  is  S  x  T 
measurable  and  vx|y  exists  by  Lemma  2.1.  The  proof  is  completed  by 
applying  Bayes  formula.  □ 

Proof  of  Proposition  3.2.  (i)  since  ^ylx^ylx  ^or  x  6  Hfj* 


dv 


Y  x 


d\|: 


(y)  ■  cN(l  *  <A(y-a-jjjX)  ,y-a-jjjX>  ) 


so  that  the  map  (x,y)  i — >  dv^j^y  diiyjx(y)  is  °(HN)  x  <*(E')  measurable. 

The  map  (x,y)  i — >  djjy|x/  duy(y)  is  o(HjP  x  c(E*)  measurable  from  the 
proof  of  Proposition  2.2,  Thus,  by  Lemma  3,3  vx|y  exists  and 

d^w  '  cn(1  *  <M;Nx-(y-a)),jNx-(y-a)>), 

which  shows  that  vx|y  =  QN((jN,y-a,A) ,wx jy) .  The  proof  of  (ii)  is  similar.  □ 


4.  Bayesian  Robustness. 


Lex  6  denote  a  decision  rule  for  estimating  the  true  signal 
x  e  H^.  6  is  a  measuxable  function  from  the  observation  space  E  into 

the  parameter  space  H^.  For  prior  vx  and  noise  the  mean  square 
error  of  6  is  given  by 

r(vx,V«)  •  /^x E  II  *■«(/)  II  ^  dv^Cx.y). 

The  following  functions  of  vx  and  vN  will  be  used  to  measure  the 

robustness  of  a  decision  rule  6e:  the  increase  in  the  mean  square  error 

in  using  60  over  the  minimum  possible  mean  square  error, 

Mvx,vN,«o)  *  r(vx,vN,60)  -  inf  r(vx,vN,«), 

5 

and  the  ratio  of  the  mean  square  error  using  60  to  the  minimum  possible 
moan  square  error, 


*(vx,vn,60) 


Let  60  be  the  optimal  (in  the  mean  square  sense)  estimator  for  Gaussian 
prior  ux  *  NCnyRy)  and  Gaussian  noise  =*  NCO.R^).  Then  60(y)  =  “^jy* 
the  posterior  mean  given  in  Proposition  2.2.  The  results  of  this  section 
give  some  upper  bounds  on  A(vx>vN,60)  and  *(vx»vN,do^  for  vx  and  vN 
as  QN-law  contaminations  of  px  and  respectively.  First  we  evaluate 
the  mean  square  error  of  60  under  contaminated  prior  or  contaminated  noise 
Denote  Rj  *  R^y  *  R^U  ♦  Ry)"1. 


Lemma  4.1.  (i)  Let  vx  a  QN((a,A) ,px) .  Then 

r*vX»Va°^  *  tr(Rj)  ♦  2cxtr(AR^), 
where  c"1  «  l  ♦  trARy  ♦  <A(my-a)  ,  .-a>. 

(ii)  Let  «  QN((a,AfPfP .  Suppose  that  E  is  a  separable 
Banach  space  and  A  has  a  separable  RKHS.  Then 

r(Wio)  *  tr(Ri>  *  2cN  tr(ANRi)* 
where  ^  »  j*A*^  c^1  *  1  *  trCAjP  ♦  <Aa,a>. 

Proof,  (i)  r(vx,vN>««)  *  ^hJe  ^  ®x|y  "  dwY|x^dvX^x^  *  But 

My  '  1  *  J,  ([UN1(en»W  -  «.V  '  <-X  ?'V)en' 


so  that 


<x-mv,e  > 
X  n  , 


411  mx|y"xll  2<S|xW*  -T—  >2^|»W 

n2i  n  n  1 


2 
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since  [U^*(en)](y)  -  <en,x>  is  a  N(0,1)  random  variable  under  u  Y|x. 

By  Lemma  3.1 

^  <0n'x*V2<lvXtx)  *  Tn  *  2cXt„2<a%'V- 
so  that 

r(VH'W  '  l,  <*  '  T~  *  2cX<AVV> 

nai  n  n 

=  trfR^CI  ♦  ♦  2cxtr(ARx2(I  ♦  R^’2). 

(ii)  is  proved  in  a  similar  way.  □ 

The  following  theorem  gives  an  upper  bound  on  the  increase  in  the  mean 
square  error  of  6«  over  the  minimum  possible  mean  square  error  under  a 
contaminated  prior  distribution. 

Theorem  4.2.  Let  vx  *  QN((a,A)  ,yx) .  Then 

A(W  6o)*4cl  H R1AH 2 [trRxR1+2cxtrARj*(l*4cx||  AR^ |J )  ||  nyall 2] ,  * 

where  cj1  ■  1  ♦  tr(ARj). 

Proof.  It  is  easily  checked  that  ACv^y^fio)  *  J£ ||  m^  -  m^|y  ||2  dvy(y) . 

By  Proposition  3,2  and  Lemma  3.1,  m^|y  ■  m^  ♦  2cxjy  R^ADn^-a) ,  so  that 

A(W6#)  s  4ci  H^aH2/e  Hmxjy*all2dvY(y)- 

Now  consider 


-  12  - 


/Ell  "xiy-all^yfy)  ■  /h^/e II  ”x|y-|‘!l2dMxc>')<lvx(x)- 

/Ell  "x|y-»ll2 S|,Of)  -  <[H1Cen))(y)  -  <en,x> 

<e  ,my-a> 

♦  <en,x-.»  - - - - 

n 

t  <e^>mY-a> 

•  l  (T7T-)2  a  ♦  {<en,x-a>  ♦  — }*). 

nil  n  n  Tn 


Use  Lemma  3.1  to  get 

<e  ,mY-a> 

V*'"’*'*’  *  — — )2*,x(’°  * 

1+T  1+T 

Tn+2cX<RXARXen,en>  *  *X(-T~)<VRXA(Vl)><V  V**  *  • 

n  n 

This  yields 

/Ell  My-all2  ^yCrt  *  r|i  .  ^x'T^^VScVV 

♦  ‘crtl(1,V'1VV‘><tAa>  ‘  ‘W**2’ 

S  trRy2  (I+Rj^)”1  ♦  2cx  trAR^CI+R^"2  ♦  4cx||  AK*  (I*RX)“1J|  ||  n^-ajp 


♦  II  va«P  * 

and  the  result  follows.  LI 

It  is  now  possible  to  give  an  upper  bound  on  tCv^u^do),  end  since 
we  are  mainly  interested  in  the  effects  of  small  amounts  of  contamination, 
we  state  it  in  the  following  form. 


I 


i 
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Corollary  4.3,  Let  vx  »  QN((a,eA) ,yx) ,  where  e>0.  Then 
4H  RXA||2  [tr(R^R  )  ♦  ||  mx-a||2  ] 

*(vx»VN,«.)  *  1  ♦  - £ - - - -  (1  ♦  o(l))e2,  as  e->0. 

In  particular,  ♦(vx,»jn,«.)  *  1  ♦  0(e2),  e-K). 

Proof.  The  result  follows  Aron  Proposition  4.1,  Theorem  4.2  and  the  identity 

"  1  +  ^,1^,6.)  -  A^^.do)  13 

The  next  theorem  gives  an  upper  bound  on  the  increase  in  the  mean  square 
error  of  6«  over  the  minimum  possible  mean  square  error  under  a  contaminated 
noise  distribution.  In  order  to  use  the  known  formulae  (Lemma  3.1)  for  the 
mean  and  covariance  operator  of  a  QN-law  on  E  it  is  assumed  for  the  remainder 
of  this  section  that  E  is  a  separable  Banach  space  and  A  has  a  separable 
RKHS. 


Theorem  4.4.  Let  vN  •  QN((a,A)  ,1^) .  Then 

A(yX'V6o)  *  *2*11  ItrRxRl  *  *snt,Vl1 

♦  tr  ♦  A^  ♦  2^)  ♦  (1  ♦  ||)  <ARNAa,a>), 

where  Aj^  «  jjJAjjj  and  c’1  ■  1  ♦  trCAgRj). 


Proof.  By  Proposition  3.2,  vxjy  ■  QN((JN,y-a,A) »Px|y) » 
4\y  ■  “xly  *  *x|y  *1  Jj^OnMy-J'  *  *>•  n,us 


and  by  Lemma  3. 1, 


-  14  - 


MVV«.)  -  /£||  ^|y-"2jy!,2dVy) 

s  4cj  /E  ||  R^jJACj^jy-y  ♦  «0  |2  dwyCy) 

S  8c2  [||  R^ll2  /Ell  "xJy-^xlPdV50 

♦  /Eil  Ri^NA^H*X-y  ♦  ^  II2  * 

It  is  easily  chocked  that 

JEII  ®x|y-mxll2dVy)  "  tr(W  4  2cNtr(ANRl)* 

Note  that  ♦  u  and  Ry  ■  ^  ♦  2cniViARN  “  , 

where  u  ■  -2^1^ (a).  Hence 

JEII  lityUfo-Y+HW1  <^Y(y) 

-  tT(R2j*AR^jM)  ♦  ||  Rjj^ACa-u)!^ 

.  -  trCR^Vn  ♦  Aj  ♦  20^  -  II  Rjj^u)!!2 

♦  ||  RjjjJACa-u)  ||2 

-  trCR^CyiyAj,  ♦  A2  ♦  2cnaJ  ♦  ||  R^gAC*)!!2 

♦  4c^«R1jj5A(a),R1ANj*A(a)> 

S  trCRjCAjjRj^  ♦  A^  ♦  2CjjA^)  ♦  (1  ♦  4(^11  Aj^ll  )<ARIJAa,a>. 

The  result  follows  immediately.  □ 


Corollary  4.5.  Let  vN  •  QN((a,eA)  ,»!„),  where  e>0.  Then 

8(||  RjAflll2  trVl  ♦  tr^Cy^.**)  *  <^.*>1 _ . 

♦(VV8*1  *  1  * - — - tr(Rp - (l*oC1)>^' 

as  e**0.  In  particular,  *(u^»vN,40)  *  1  ♦  0(e2),  c-*0. 


Exapples 


1.  The  one-dimensional  oaae  with  contaminated  prior .  Let  X  and  N  be 

independent  random  variables  with  distributions  vx  ■  QN( (a^.c) ,ux)  and 
2  2 

Vjj  »  N(0,on)  respectively,  where  yx  ■  Then  Y  ■  X  ♦  N,  A  «  eo, 

2  _ 
rN  =  p 

E_ 
in 
6 

2.  The  one-dimeneional  ease  with  contaminated  noise.  Let  X  and  N  be 

independent  random  variables  with  distributions  ux  ■ 

2  2 

vN  *  QNCCO.e),^)  respectively,  where  y^  »  N(0,oN) .  Then  *  eoN, 

*  p  and  by  Corollary  4.S 

irf(E(x!«c$z  s  1  *  *V11  *  (T^)2,(1*°(1))£2' 

« 

3.  Kalman  filtering  in  the  presence  of  contamination.  Let  the  signal 
process  Xt  and  the  observation  process  Yt  be  given  by  the  stochastic 
differential  equations 

dXt  «  -BXtdt  ♦  dwj 

and  dYt  -  Xtdt  ♦  dW* 

1  2 

(OStsi),  where  W  and  N  are  independent  Wiener  processes,  0>O,  and 

l  12 

X«  is  a  N(0,ig)  random  variable  which  is  independent  of  W  and  W  . 

Then  E  -  C[0,1J,  -  L*[0,1],  JN  :  ^  •+■  E  is  defined  by 

jN(f)(t)  »  Jjf(s)ds,  for  feH^,  tc [0,1],  Pjj  is  the  integral  operator 
with  kernel  min  (s,t)  and  is  the  integral  operator  on  L2[0,1]  with 
kernel  ioo"^8"^.  can  be  expressed  as  the  solution  of  a  stochastic 


N(V°x> 


and 


,  the  signal  to  noise  ratio. 

>2 


By  Corollary  4.3 

W  ‘ 1  * 


4oJp 


and 


differential  equation  for  the  interpolation  of  a  Gaussian  process  (see 
Liptser  and  Shiryayev,  1978). 

a)  Contaminated  signal.  Let  A  be  the  identity  operator  on  L2[0,1] 
and  let  »  QN((0,eA),ux),  where  *  N(0,B;x).  By  Corollary  4.3 

s  1  ♦  4  trCR^JCl+oClJJe2.  But  trCR^  »  1/28  so  that 

*  1  ♦  |<l+o(l))e2. 

b)  Contaminated  noise.  Let  A  be  the  natural  injection  of  C[0,1]  into 

C*[0,1]  and  let  •  QN((0,cA) ,yw) ,  where  y w  is  Wiener  measure  on  C[0,1]. 

*2 

Thus  dwNydpw(x)  *  cN(l  ♦  e/q x^dt) ,  where  c^  is  a  constant.  By  Corollary 
4.5,  *(Ux,vn,6#)  s  1  ♦  24  trCl^) (trAN)2(l+o(l))e2.  But  Aj^  is  the  integral 
operator  on  L2[0,1]  with  kernel  min  (s,t).  Thus  trCA^)  *  j  and  it 
follows  that 

Si*  j(l*o(l))e2. 
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